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This  report  summarizes  the  research  conducted) under  Contract  F4062O82- 


C-0082  during  the  period  May  1083  -  April  1085.  20  technical  reports  were 
issued  under  the  contract  during  that  period.  Most/of  these  reports  have  resulted 


in,  or  are  expected  to  result  in,  published  papers.  ^Abstracts  of  4he  reports  are 

given  below  in  chronological  order.  (Numbers  in  brackets  refer  to  these 

^  T/i /y 

abstracts.)  The  reports  deal^rith  the  following  topics: 


r  ' 

a)  Parallel  algorithms  and  architectures  for  processing  images  [1,6,17,18,10]  and 

data  structures  derived  from  imagesr{2, 14,16]  (see  also  [3]  on  image  process- 

J) 

ing  software) 

b)  Interprocessor  communication  for  parallel  processingj[5, 10, 11,15] 

c)  Knowledge-based  image  understanding  *[4,13] 


4^  d)  Image  modeling  ((texture,  shape)  [7,8,0,12420] 


Research  is  continuing\in  all  of  these  areas  under  a  successor  contract. 


ABSTRACTS  OF  TECHNICAL  REPORTS 


1.  Roger  Eastman,  “Reliability  in  Cellular  Arrays  Through  Data  Sharing", 
CS-TR-1285,  CAR-TR-7,  May  1083. 

This  paper  describes  several  methods  for  increasing  cellular  array  reliability  for 
Image  processing  through  redundancy,  with  emphasis  on  error  masking  methods 
that  preserve  continuous  operation  of  the  array.  We  define  n -module  data  shar¬ 
ing  (NMDS)  for  a  cellular  array,  which  achieves  redundancy  by  sharing  data  and 
computations  between  adjacent  processors  in  the  array.  The  time,  space  and  reli¬ 
ability  performance  of  NMDS  is  compared  to  standard  n -module  redundancy 
(NMR). 

2.  Simon  Kasif  and  Reinhard  Klette,  “A  Data  Allocation  Problem  for  SIMD 
Systems",  CS-TR-1292,  CAR-TR-11,  June  1083. 

Let  S  «=  {Sj  ,5],...,  Sff)  be  a  set  of  data  objects,  and  /  be  function  from 
S  X  S  into  S .  For  T  Q  {1,2 ,  ,  N  }*  the  operation  of  Tf  applies  /  to 

every  element  of  {  ($,-  ,  Sj )  |  («' ,  j )  €  T  }  C  S  X  S.  For  the  optimal  computa¬ 
tion  of  Tf  on  a  SIMD  system  with  p  processors,  the  data  allocation  problem  con¬ 
sists  in  distributing  objects  to  the  p  processors  such  that  the  computation  and 
data  transfer  time  is  minimised.  In  the  paper,  the  cases  T  ■*  {1,  2 ,  .  .  .  ,  N  }2 
and  T  <  •  <  N)  are  dealt  with. 

3.  Reinhard  Klette,  Klaus  Voss,  and  Peter  Hufnagl,  “Interactive  Software  Sys¬ 
tems  for  Computer  Vision",  CS-TR-1293,  CAR-TR-12,  June  1083. 

This  paper  gives  an  overview  of  the  main  directions  of  program  system  develop¬ 
ment  in  computer  virion,  where  special  emphasis  is  paid  to  interactivity.  A  sys¬ 
tematic  representation  of  interactive  computer  virion  systems  is  given.  Brief 
information  about  30  different  software  systems  is  listed  in  an  Appendix. 

4.  Vincent  Shang-Shouq  Hwang,  Takashi  Matsuyama,  Larry  S.  Davis,  and 
Asriel  Rosenfeld,  “Evidence  Accumulation  for  Spatial  Reasoning  in  Aerial 
Image  Understanding",  CS-TR-1338,  CAR-TR-28,  October  1083. 

We  describe  a  control  structure  for  building  an  Image  Understanding  System. 
This  system  can  deal  with  objects  with  diverse  appearances  when  consistent  spar 
tial  relations  exist  between  objects.  By  accumulating  consistent  predictions  ori¬ 
ginated  by  existing  Instances,  our  system  can  dynamically  reason  about  what  to 
do  in  order  to  construct  interpretations  of  the  image.  In  this  paper,  we  have  dis¬ 
cussed  parts  of  the  proposed  system  —  the  representation  of  spatial  knowledge, 
the  accumulation  of  evidence,  the  focus  of  attention  mechanism,  and  the  integra¬ 
tion  of  constraints  for  top-down  control. 


5.  LV.  Ramakrishnan,  DJS.  Fussell,  and  A.  SUberschatz,  “On  Mapping  Homo¬ 
geneous  Graphs  on  A  Linear  Array-Processor  Model”,  CS-TR-1330, 
CAR-TR-30,  October  1083. 

This  paper  presents  a  formal  model  of  linear  array  processors  suitable  for  VLSI 
implementation  as  well  as  graph  representation  of  programs  suitable  for  execution 
on  such  a  model.  A  distinction  is  made  between  correct  mapping  and  correct 
execution  of  such  graphs  on  this  model  and  the  structure  of  correctly  mappable 
graphs  is  examined.  The  formalism  developed  is  used  to  synthesise  algorithms 
for  this  model. 

6.  I.V.  Ramakrishnan,  “Modular  Matrix  Multiplication  On  A  Linear  Array” 
CS-TR-1340,  CAR-TR-31,  November  1083. 

A  matrix-multiplication  algorithm  on  a  linear  array  using  an  optimal  number  of 
processing  elements  is  proposed.  The  local  storage  required  by  the  processing  ele¬ 
ments  and  the  I/O  bandwidth  required  to  drive  the  array  are  both  constants  that* 
are  independent  of  the  sizes  of  the  matrices  being  multiplied.  The  algorithm  is 
therefore  modular,  that  is,  arbitrarily  large  matrices  can  be  multiplied  on  a  large 
array  built  by  cascading  small  arrays.  The  array  is  well-suited  for  VLSI  imple¬ 
mentation. 

7.  Songde  Ma  and  Andrd  Gagalowicz,  “A  Parallel  Method  for  Natural  Texture 
Synthesis”,  CS-TR-1343  ,  CAR-TR-32,  November  1983. 

This  paper  deals  with  an  optimization  technique  applied  to  natural  texture  syn¬ 
thesis.  We  propose  a  definition  of  a  global  criterion  which  is  the  mean  square 
error  between  the  statistical  features  of  a  natural  original  texture  and  those  of  an 
artificially  generated  one.  A  gradient  algorithm  is  used  to  minimise  this  criterion. 
The  statistical  feature  vector  used  was  the  autocorrelation  function  although  this 
is  by  no  means  the  only  choice.  The  textures  generated  are  very  similar  to  the 
original  ones.  This  method  can  be  implemented  in  a  highly  parallel  manner. 

8.  Andrd  Gagalowicz  and  Song  De  Ma,“Synthesb  of  Natural  Textures  on  3-D 
Surfaces”,  C&-TR-1344,  CAR-TR-33,  November  1983. 

This  paper  presents  a  new  method  for  the  synthesis  of  textures  on  3-D  surfaces. 
To  our  knowledge,  one  basic  technique  has  been  presented  up  to  now  in  the 
literature  (see  (7-18)).  In  this  standard  method,  textures  are  synthesised  by  map¬ 
ping  a  rectangular  template  onto  the  curved  surface.  This  method  is  complex, 
requires  substantial  computing  time,  and  presents  some  drawbacks  such  as  the 
possibility  of  obtaining  aliasing  effects  and  continuity  problems  along  the  edges  of 
the  curved  templates.  Procedures  to  eliminate  these  problems  are  available 
(11,12)  but  make  this  synthesis  even  more  unattractive.  The  method  proposed  in 
this  paper  does  not  present  the  former  drawbacks.  We  do  not  use  a  template 


mapping,  -which  Is  a  drawback  in  itself.  The  synthesis  is  achieved  continuously 
on  the  surface,  so  that  there  are  no  edge  effects  and  also  no  aliasing  effects.  This 
method  is  a  simple  extension  of  a  procedure  that  we  have  proposed  before  in  the 
literature  [2,3]  for  planar  textures.  Any  kind  of  texture  can  be  reproduced  with  a 
good  similarity  to  the  reference  texture  used.  It  also  has  the  important  advan¬ 
tage  that  only  one  set  of  second  order  statistics  (a  small  amount  of  data)  needs  to 
be  computed  on  a  planar  version  of  the  reference  texture  to  synthesise  this  tex¬ 
ture  on  any  surface  and  at  any  distance.  Some  results  on  simple  surfaces  are 
displayed  (cylinder,  sphere),  but  the  method  holds  for  any  surface  and  is  rela¬ 
tively  quick  and  easy. 

0.  Andr<f  Gagalowicz  and  Song  De  Ma,  “Sequential  Synthesis  of  Natural  Tex¬ 
tures”,  CS-TR-1345,  CAR-TR-34,  November  1083. 

A  new  method  for  the  generation  of  natural  textures  is  presented.  Using  a  priori 
given  second  order  statistics  (second  order  spatial  averages  or  autocorrelation 
parameters)  of  a  natural  texture  as  input,  we  give  a  procedure  to  synthesise  art* 
artificial  texture  field  in  such  a  way  that  its  second  order  statistics  are  equal  to 
the  desired  ones.  The  synthesis  is  achieved  directly  without  inventing  higher 
order  statistics,  as  was  the  ease  in  earlier  publications  [3-7,15].  This  method 
allows  us  to  synthesize  gray  tone  texture  fields  while  “controlling”  their  second 
order  statistics  in  rather  large  neighborhoods.  The  synthesized  textures  are  very 
similar  visually  to  the  original  natural  textures  used  to  compute  the  second  order 
statistics,  but  second  order  spatial  averages  give  better  results  than  autocorrela¬ 
tion  parameters.  This  seems  to  strongly  support  the  conjecture  [5,8,10]  that  the 
visual  system  is  only  sensitive  to  the  second  order  spatial  averages  of  a  given  tex¬ 
ture  field,  so  that  these  statistics  should  be  used  to  model  textures. 

10.  Angela  Y.  Wu,  “Embedding  of  Networks  of  Processors  Into  Hypercubes”, 
CS-TR-1354,  CAR-TR-35,  December  1083. 

The  hypercube  is  a  good  host  graph  for  the  embedding  of  networks  of  processors 
because  of  its  km  degree  and  low  diameter.  Graphs  such  as  trees  and  arrays  can 
be  embedded  into  a  hypercube  with  small  dilation  and  expansion  costs,  but  there 
are  classes  of  graphs  which  can  be  embedded  into  a  hypercube  only  with  large 
expansion  cost  or  large  dilation  cost. 

11.  LV.  Ramakrishnan  and  P.J.  Varman,  “On  Mapping  Cube  Graphs  on  VLSI 
Array  and  Tree  Architectures”,  CS-TR-1S58,  CAR-TR-41,  December  1083. 

We  formalise  a  model  of  array  architectures  suitable  for  VLSI  implementation.  A 
formal  model  of  an  arbitrarily  structured  tree  machine  is  also  presented.  A 
mathematical  framework  is  developed  to  transform  cube  graphs,  which  are  data¬ 
flow  descriptions  of  certain  matrix  computations,  onto  the  array  and  tree  models. 
All  published  algorithms  for  these  computations  ran  be  obtained  using  the 


mathematical  framework.  In  addition,  novel  linear-array  algorithms  for  matrix 
multiplication  are  obtained.  More  importantly,  the  algorithms  obtained  for  the 
tree  model  are  of  special  significance.  Besides  their  novelty,  the  independence  of 
the  tree  algorithms  from  a  specific  inter-processor  communication  geometry  mane 
them  robust  to  hardware  faults  as  opposed  to  algorithms  that  are  based  on 
specific  interconnection  requirements. 

12.  Axriel  Rosenfeld  and  Reinhard  Klette,  “Degree  of  Adjacency  or  Surrounded- 
ness”,  CS-TR-1380,  CAR-TR-53,  March  1984. 

Definitions  of  the  degree  of  adjacency  of  two  regions  in  the  plane,  and  the  degree 
of  surroundedness  of  one  region  by  another,  are  proposed.  Some  elementary  pro¬ 
perties  of  these  concepts  are  established,  and  it  is  also  shown  that  they  have 
natural  generalizations  to  fuzzy  subsets  of  the  plane.  Applications  of  the  pro¬ 
posed  measures  to  digital  polygons  are  demonstrated  and  fast  algorithms  for  com¬ 
puting  these  measures  are  given. 

13.  Takashi  Matsuyama,  Vincent  Shang-Shouq  Hwang,  and  Larry  S.  Davis, 
"Evidence  Accumulation  for  Spatial  Reasoning",  CS-TR-1381, 
CAR-TR-54,  March  1984. 

This  paper  describes  the  evidence  accumulation  process  of  an  image  understand¬ 
ing  system  first  described  in  [l],  which  enables  the  system  to  perform  top-down 
(goal-oriented)  picture  processing  as  well  as  bottom-up  verification  of  consistent 
spatial  relations  among  objects. 

14.  LV.  Ramakrishnaa,  "Dynamic  Programming  and  Transitive  Closure  on 
Linear  Pipelines”,  CS-TR-1388,  CAR-TR-87,  May  1984. 


Algorithms  for  the  dynamic  programming  and  transitive  closure  problems  are 
presented  for  a  linear  pipeline  of  processors-  These  algorithms  require  only  a  con¬ 
stant  number  of  I/O  ports  and  are  optimelin  their  area  and  time  requirements. 


18. 


SJC  Bhaskar,  Axriel  Rosenfeld  and  Angela  Y.  Wu,  "Simulation  of  Large 
Networks  of  Processors  By  Smaller  Ones",  CS-TR-1401,  CAR-TR-63,  May 
1984. 


This  paper  considers  the  problem  of  simulating  a  large  network  N  of  processors 
using  a  small  set  of  p  processors.  The  approach  taken  is  to  partition  the  nodes 


of  N  into  f  subsets  Nx , 


AT.  and  to  assign  each  subset  to  a  processor  for 


simulation.  In  order  to  equalise  the  workloads  of  the  processors,  the  sizes  of 


N 


i ' 


N.  should  be  as  equal  as  possible;  and  in  order  to  minimize  (and 


equalise)  the  amount  of  message  passing  between  the  processors,  the  number  of 
pairs  of  nodes  that  are  neighbors  in  N  but  belong  to  different  subsets  should  be 
as  small  (and  as  equal)  as  possible.  We  discuss  the  general  problem  of 


partitioning  a  graph  N  so  as  to  satisfy  these  criteria,  and  also  consider  the  par¬ 
ticular  case  of  partitioning  a  tree. 

16.  Simon  Kasif,  "Parallel  Searching  and  Merging  On  ZMOB”,  CS-TR-1405, 
CAR-TR-64,  June  1684. 

One  of  the  most  difficult  issues  that  must  be  addressed  when  studying  a  class  of 
parallel  algorithms  b  the  problem  of  choosing  a  model  that  captures  the  inherent 
difficulty  of  implementing  these  algorithms  on  a  multiprocessor  architecture. 
Shared  memory  models  have  proven  to  be  an  effective  tool  for  deriving  lower 
bounds  on  the  complexity  of  comparison  problems.  In  particular,  a  speed-up  of 
lg(P)  b  possible  for  the  problem  of  finding  an  element  in  an  AT -element  sorted 
list,  and  speed-ups  of  P  /lglg P  and  P  are  possible  for  merging  iV -element  sorted 
lists  on  P  processors  for  the  cases  of  N  ■»  P  and  P  <  N  respectively. 

In  practice,  these  speed-ups  are  not  attainable  since  the  shared  memory  model; 
ignore  many  practical  considerations  in  multiprocessor  systems,  such  as  interpro¬ 
cessor  communications,  distribution  of  data  on  local  memories  and  limited  fan¬ 
out  of  memory  locations.  In  thb  paper  we  introduce  a  model  for  parallel  compu¬ 
tation  that  b  strictly  weaker  than  the  shared  memory  modeb.  The  model  b 
based  on  an  actual  machine  currently  being  constructed  (ZMOB).  We  examine 
the  communication  facilities  available  in  the  model  and  show  that  lower  bounds 
for  merging  and  searching  on  shared  memory  modeb  are  attainable  (within  a  con¬ 
stant).  The  main  results  reported  in  the  paper  are: 

•  an  O  (lg/V  /lgP )  algorithm  for  searching  an  JV -element  sorted  list  distri¬ 
buted  on  P  processors. 

•  an  0(N  /P)  algorithm  for  merging  two  N -element  lists  on  2 P  processors. 

•  an  O  (IgfV)  algorithm  tor  merging  two  N  -element  lists  on  2 N  processors. 

•  criteria  and  techniques  for  simulating  CREW  PRAM  algorithms  on  ZMOB. 
One  oif  the  techniques  b  used  to  establish  an  O (IglgJV)  lower  bound  for  merging 
two  JV-element  Bats  on  2 N  processors. 

17.  Axriel  Rosenfeld,  "The  Prism  Machine:  An  Alternative  to  the  Pyramid", 
CS-TR-1416,  CAR-TR-70,  July  1084. 

The  prism  machine  b  a  stack  of  n  cellular  arrays,  each  of  sise  2”  X  2” .  Cell 
(t  J)  on  level  k  b  connected  to  eelb  (« J ),  (i  +  2*  J ),  and  (»  J  4-2* )  on  level 
k  + 1,  1  <  fc  <  » ,  where  the  sums  are  modulo  2” .  Such  a  machine  can  perform 
various  operations  (e.g.,  "Gaussian”  convolutions  or  least-squares  polynomial  fits) 
on  image  neighborhoods  of  power-of-2  sixes  in  every  position  in  O  (n )  time, 
unlike  a  pyramid  machine  which  can  do  thb  only  In  sampled  positions.  It  can 
also  compute  the  discrete  Fourier  transform  in  O  (n  1  time.  It  consists  of  n  -4* 
cells,  while  a  pyramid  consists  of  fewer  than  4”  +  73  cells,  but  in  practice  n 
would  be  at  most  10,  so  that  a  prism  would  be  at  most  about  seven  times  as 
large  as  a  pyramid. 


18.  LV.  Ramakrishnan  and  Shaunak  Pawagi,  “Parallel  Update  of  Minimum 
Spanning  Trees  in  Logarithmic  Time",  CS-TR-1452,  CAR-TR-97, 
November  1084. 

Parallel  algorithms  are  presented  for  updating  a  minimum  spanning  tree  when 
the  cost  of  an  edge  changes  or  when  a  new  node  is  inserted  in  the  underlying 
graph.  The  machine  model  used  is  a  parallel  random  access  machine  which 
allows  simultaneous  reads  but  prohibits  simultaneous  writes  into  the  same 
memory  location.  The  algorithms  described  in  this  paper  for  updating  a 
minimum  spanning  tree  require  O  (logn  )  time  and  O  (n  2)  processors.  These  algo¬ 
rithms  are  efficient  when  compared  to  previously  known  algorithms  for  initial 
construction  of  a  minimum  spanning  tree  that  require  0(log2n)  time  and  use 
O  (n2)  processors. 

18.  Angela  Y.  Wu,  “Parallel  Processing  of  Encoded  Bit  Strings”,  CS-TR-1455, 
CAR-TR-88,  November  1884. 

m 

Many  operations  on  strings  of  length  n  can  be  speeded  up  by  a  factor  of  p  using 
p  processors.  String  operations  can  also  be  speeded  up,  even  when  a  single  pro¬ 
cessor  is  used,  by  compactly  encoding  the  strings,  e.g.,  using  run  length  code. 
This  paper  shows  how  to  combine  these  two  approaches  by  using  p  processors  to 
process  compactly  encoded  strings. 
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Classes  of  “ribbonlike”  planar  shapes  can  be  defined  by  specifying  an  arc,  called 
the  spine  or  axis,  and  a  geometric  figure  such  as  a  disk  or  line  segment,  called  the 
generator,  that  “sweeps  out”  the  shape  by  moving  along  the  spine,  changing  size 
as  it  moves.  Shape  descriptions  of  this  type  have  been  considered  by  Blum, 
Brooks,  Brady,  and  others.  This  paper  considers  such  descriptions  from  the 
standpoints  of  both  generation  and  recovery  (i.e.  given  a  shape  generated  in  this 
way,  to  determine  the  axis  and  generation  rule  that  gave  rise  to  it),  and  discusses 
their  relative  advantages  and  disadvantages. 


